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METHOD AND SYSTEM FOR AUTOMATED FOCUS-DISTANCE 
DETERMINATION FOR MOLECULAR ARRAY SCANNERS 



TECHNICAL FIELD 

5 The present invention relates to the molecular array scanners and, in 

particular, to automated adjustment of the focal distance at which a sample molecular 
array is positioned with respect to laser light sources and photodetectors to produce 
optimal data acquisition. 

1 0 BACKGROUND OF THE INVENTION 

The present invention is related to acquisition of molecular-array data 
{ and other types of genetic, biochemical, and chemical data from molecular arrays by 



fll molecular array scanners. A general background of molecular-array technology is 

first provided, in this section, to facilitate discussion of the scanning techniques 



Lsj 15 described in following sections. 

3 

« Array technologies have gained prominence in biological research and 

U are likely to become important and widely used diagnostic tools in the healthcare 

industry. Currently, molecular-array techniques are most often used to determine the 
O concentrations of particular nucleic-acid polymers in complex sample solutions. 

20 Molecular-array-based analytical techniques are not, however, restricted to analysis 
of nucleic acid solutions, but may be employed to analyze complex solutions of any 
type of molecule that can be optically or radiometrically scanned and that can bind 
with high specificity to complementary molecules synthesized within, or bound to, 
discrete features on the surface of an array. Because arrays are widely used for 
25 analysis of nucleic acid samples, the following background information on arrays is 
introduced in the context of analysis of nucleic acid solutions following a brief 
background of nucleic acid chemistry. 

Deoxyribonucleic acid ("DNA") and ribonucleic acid ("RNA") are 
linear polymers, each synthesized from four different types of subunit molecules. 
30 The subunit molecules for DNA include: (1) deoxy-adenosine, abbreviated "A," a 
purine nucleoside; (2) deoxy-thymidine, abbreviated "T," a pyrimidine nucleoside; 
(3) deoxy-cytosine, abbreviated "C," a pyrimidine nucleoside; and (4) deoxy- 
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guanosine, abbreviated "G," a purine nucleoside. The subunit molecules for RNA 
include: (1) adenosine, abbreviated "A," a purine nucleoside; (2) uracil, abbreviated 
"U," a pyrimidine nucleoside; (3) cytosine, abbreviated "C," a pyrimidine nucleoside; 
and (4) guanosine, abbreviated "G," a purine nucleoside. Figure 1 illustrates a short 
5 DNA polymer 100, called an oligomer, composed of the following subunits: (1) 
deoxy-adenosine 102; (2) deoxy-thymidine 104; (3) deoxy-cytosine 106; and (4) 
deoxy-guanosine 108. When phosphorylated, subunits of DNA and RNA molecules 
are called "nucleotides" and are linked together through phosphodiester bonds 110- 
1 15 to form DNA and RNA polymers. A linear DNA molecule, such as the oligomer 
10 shown in Figure 1, has a 5 f end 118 and a 3' end 120. A DNA polymer can be 
chemically characterized by writing, in sequence from the 5* end to the 3 f end, the 
C3 single letter abbreviations for the nucleotide subunits that together compose the DNA 

m polymer. For example, the oligomer 100 shown in Figure 1 can be chemically 

^ represented as "ATCG." A DNA nucleotide comprises a purine or pyrimidine base 

yj 15 (e.g. adenine 122 of the deoxy-adenylate nucleotide 102), a deoxy-ribose sugar (e.g. 

" deoxy-ribose 124 of the deoxy-adenylate nucleotide 102), and a phosphate group (e.g. 

jj phosphate 126) that links one nucleotide to another nucleotide in the DNA polymer. 

In RNA polymers, the nucleotides contain ribose sugars rather than deoxy-ribose 

yj 

O sugars. In ribose, a hydroxyl group takes the place of the 2' hydrogen 128 in a DNA 



20 nucleotide. RNA polymers contain uridine nucleosides rather than the deoxy- 
thymidine nucleosides contained in DNA. The pyrimidine base uracil lacks a methyl 
group (130 in Figure 1) contained in the pyrimidine base thymine of deoxy- 
thymidine. 

The DNA polymers that contain the organization information for 
25 living organisms occur in the nuclei of cells in pairs, forming double -stranded DNA 
helixes. One polymer of the pair is laid out in a 5 1 to 3' direction, and the other 
polymer of the pair is laid out in a 3* to 5' direction. The two DNA polymers in a 
double-stranded DNA helix are therefore described as being anti-parallel. The two 
DNA polymers, or strands, within a double-stranded DNA helix are bound to each 
30 other through attractive forces including hydrophobic interactions between stacked 
purine and pyrimidine bases and hydrogen bonding between purine and pyrimidine 
bases, the attractive forces emphasized by conformational constraints of DNA 
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polymers. Because of a number of chemical and topographic constraints, double- 
stranded DNA helices are most stable when deoxy-adenylate subunits of one strand 
hydrogen bond to deoxy-thymidylate subunits of the other strand, and deoxy- 
guanylate subunits of one strand hydrogen bond to corresponding deoxy-cytidilate 
subunits of the other strand. 

Figures 2A-B illustrate the hydrogen bonding between the purine and 
pyrimidine bases of two anti-parallel DNA strands. Figure 2A shows hydrogen 
bonding between adenine and thymine bases of corresponding adenosine and 
thymidine subunits, and Figure 2B shows hydrogen bonding between guanine and 
cytosine bases of corresponding guanosine and cytosine subunits. Note that there are 
two hydrogen bonds 202 and 203 in the adenine/thymine base pair, and three 
hydrogen bonds 204-206 in the guanosine/cytosine base pair, as a result of which GC 
base pairs contribute greater thermodynamic stability to DNA duplexes than AT base 
pairs. AT and GC base pairs, illustrated in Figures 2A-B, are known as Watson- 
Crick ("WC") base pairs. 

Two DNA strands linked together by hydrogen bonds forms the 
familiar helix structure of a double-stranded DNA helix. Figure 3 illustrates a short 
section of a DNA double helix 300 comprising a first strand 302 and a second, anti- 
parallel strand 304. The ribbon-like strands in Figure 3 represent the deoxyribose and 
phosphate backbones of the two anti-parallel strands, with hydrogen-bonding purine 
and pyrimidine base pairs, such as base pair 306, interconnecting the two strands. 
Deoxy-guanylate subunits of one strand are generally paired with deoxy-cytidilate 
subunits from the other strand, and deoxy-thymidilate subunits in one strand are 
generally paired with deoxy-adenylate subunits from the other strand. However, non- 
WC base pairings may occur within double-stranded DNA. 

Double-stranded DNA may be denatured, or converted into single 
stranded DNA, by changing the ionic strength of the solution containing the double- 
stranded DNA or by raising the temperature of the solution. Single-stranded DNA 
polymers may be renatured, or converted back into DNA duplexes, by reversing the 
denaturing conditions, for example by lowering the temperature of the solution 
containing complementary single-stranded DNA polymers. During renaturing or 
hybridization, complementary bases of anti-parallel DNA strands form WC base pairs 
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in a cooperative fashion, leading to reannealing of the DNA duplex. Strictly A-T and 
G-C complementarity between anti-parallel polymers leads to the greatest 
thermodynamic stability, but partial complementarity including non-WC base pairing 
may also occur to produce relatively stable associations between partially- 
complementary polymers. In general, the longer the regions of consecutive WC base 
pairing between two nucleic acid polymers, the greater the stability of hybridization 
between the two polymers under renaturing conditions. 

The ability to denature and renature double-stranded DNA has led to 
the development of many extremely powerful and discriminating assay technologies 
for identifying the presence of DNA and RNA polymers having particular base 
sequences or containing particular base subsequences within complex mixtures of 
different nucleic acid polymers, other biopolymers, and inorganic and organic 
chemical compounds. One such methodology is the array-based hybridization assay. 
Figures 4-7 illustrate the principle of the array-based hybridization assay. An array 
(402 in Figure 4) comprises a substrate upon which a regular pattern of features are 
prepared by various manufacturing processes. The array 402 in Figure 4, and in 
subsequent Figures 5-7, has a grid-like two-dimensional pattern of square features, 
such as feature 404 shown in the upper left-hand corner of the array. It should be 
noted that many molecular arrays contain disk-shaped features, rather than round 
features. Each feature of the array contains a large number of identical 
oligonucleotides covalently bound to the surface of the feature. These bound 
oligonucleotides are known as probes. In general, chemically distinct probes are 
bound to the different features of an array, so that each feature corresponds to a 
particular nucleotide sequence. In Figures 4-6, the principle of array-based 
hybridization assays is illustrated with respect to the single feature 404 to which a 
number of identical probes 405-409 are bound. In practice, each feature of the array 
contains a high density of such probes but, for the sake of clarity, only a subset of 
these are shown in Figures 4-6. 

Once an array has been prepared, the array may be exposed to a 
sample solution of target DNA or RNA molecules (410-413 in Figure 4) labeled with 
fluorophores, chemiluminescent compounds, or radioactive atoms 415-418. Labeled 
target DNA or RNA hybridizes through base pairing interactions to the 
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complementary probe DNA, synthesized on the surface of the array. Figure 5 shows 
a number of such target molecules 502-504 hybridized to complementary probes 
505-507, which are in turn bound to the surface of the array 402. Targets, such as 
labeled DNA molecules 508 and 509, that do not contains nucleotide sequences 
5 complementary to any of the probes bound to array surface, do not hybridize to 
generate stable duplexes and, as a result, tend to remain in solution. The sample 
solution is then rinsed from the surface of the array, washing away any unbound 
labeled DNA molecules. Finally, the bound labeled DNA molecules are detected via 
optical or radiometric scanning. Figure 6 shows labeled target molecules emitting 

10 detectable fluorescence, radiation, or other detectable signal. Optical scanning 
involves exciting labels of bound labeled DNA molecules with electromagnetic 
radiation of appropriate frequency and detecting fluorescent emissions from the 
labels, or detecting light emitted from chemiluminescent labels. When radioisotope 
labels are employed, radiometric scanning can be used to detect the signal emitted 

15 from the hybridized features. Additional types of signals are also possible, including 
electrical signals generated by electrical properties of bound target molecules, 
magnetic properties of bound target molecules, and other such physical properties of 
bound target molecules that can m ^^^^— produce a detectable 

signal. Optical, radiometric, or other types of 

scanning produce an analog or digital representation 

25 of the array as shown in Figure 7, ^^^^^^^^^1 features to which 

labeled target molecules are I^^^^^^^^^^^^^m hybridized similar to 
706 optically or digitally W^^B^ml^mmB^BS differentiated from 

30 those features to which no labeled DNA molecules are bound. In other words, the 
analog or digital representation of a scanned array displays positive signals for 
features to which labeled DNA molecules are hybridized and displays negative 
features to which no, or an undetectably small number of, labeled DNA molecules are 
bound. Features displaying positive signals in the analog or digital representation 

35 indicate the presence of DNA molecules with complementary nucleotide sequences in 
the original sample solution. Moreover, the signal intensity produced by a feature is 
generally related to the amount of labeled DNA bound to the feature, in turn related 
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to the concentration, in the sample to which the array was exposed, of labeled DNA 
complementary to the oligonucleotide within the feature. 

Array-based hybridization techniques allow extremely complex 
solutions of DNA molecules to be analyzed in a single experiment. An array may 
5 contain from hundreds to tens of thousands of different oligonucleotide probes, 
allowing for the detection of a subset of complementary sequences from a complex 
pool of different target DNA or RNA polymers. In order to perform different sets of 
hybridization analyses, arrays containing different sets of bound oligonucleotides are 
manufactured by any of a number of complex manufacturing techniques. These 
10 techniques generally involve synthesizing the oligonucleotides within corresponding 
« features of the array through a series of complex iterative synthetic steps, or 

C3 depositing oligonucleotides isolated from biological material. 

g3 

fj !! j As pointed out above, array-based assays can involve other types of 

a 2 biopolymers, synthetic polymers, and other types of chemical entities. For example, 

=£113 

LJ 15 one might attach protein antibodies to features of the array that would bind to soluble 

IL labeled antigens in a sample solution. Many other types of chemical assays may be 

rjj facilitated by array technologies. For example, polysaccharides, glycoproteins, 

S synthetic copolymers, including block copolymers, biopolymer-like polymers with 

O synthetic or derivitized monomers or monomer linkages, and many other types of 

20 chemical or biochemical entities may serve as probe and target molecules for array- 
based analysis. A fundamental principle upon which arrays are based is that of 
specific recognition, by probe molecules affixed to the array, of target molecules, 
whether by sequence-mediated binding affinities, binding affinities based on 
conformational or topological properties of probe and target molecules, or binding 
25 affinities based on spatial distribution of electrical charge on the surfaces of target 
and probe molecules. 

An "array", unless a contrary intention appears, includes any one, two 
or three dimensional arrangement of addressable regions bearing a particular 
chemical moiety to moieties (for example, biopolymers such as polynucleotide 
30 sequences) associated with that region. An array is "addressable" in that it has 
multiple regions of different moieties (for example, different polynucleotide 
sequences) such that a region (a "feature" or "spot" of the array) at a particular 
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predetermined locarion (an "address") on the array will detect a particular target or 
class of targets (although a feature may incidentally detect non-targets of that 
feature). Array features are typically, but need not be, separated by intervening 
spaces. In the case of an array, the "target" will be referenced as a moiety in a mobile 
5 phase (typically fluid), to be detected by probes ("target probes") which are bound to 
the substrate at the various regions. However, either of the "target" or "target probes" 
may be the one which is to be evaluated by the other (thus, either one could be an 
unknown mixture of polynucleotides to be evaluated by binding with the other). An 
"array layout" refers collectively to one or more characteristics of the features, such 
10 as feature positioning, one or more feature dimensions, and the chemical moiety or 
m mixture of moieties at a given feature. "Hybridizing" and "binding", with respect to 

w polynucleotides, are used interchangeably. 

CO 

fj^ Any given substrate may carry one, two, four or more or more arrays 

2 disposed on a front surface of the substrate. Depending upon the use, any or all of the 

J 1 5 arrays may be the same or different from one another and each may contain multiple 

* spots or features. A typical array may contain more than ten, more than one hundred, 

FU more than one thousand more ten thousand features, or even more than one hundred 

thousand features, in an area of less than 20 cm 2 or even less than 10 cm 2 . For 
example, features may have widths (that is, diameter, for a round spot) in the range 
20 from a 1 0 jjm to 1 .0 cm. In other embodiments each feature may have a width in the 
range of 1.0 to 1.0 mm, usually 5.0 pm to 500 urn, and more usually 10 (am to 
200 jim. Non-round features may have area ranges equivalent to that of circular 
features with the foregoing width (diameter) ranges. At least some, or all, of the 
features may be of different compositions (for example, when any repeats of each 
25 feature composition are excluded the remaining features may account for at least 5%, 
10%, or 20% of the total number of features). Interfeature areas will typically (but 
not essentially) be present which do not carry any polynucleotide (or other 
biopolymer of a type of which the features are composed). Such interfeature areas 
typically will be present where the arrays are formed by processes involving drop 
30 deposition of reagents but may not be present when, for example, photolithographic 
array fabrication processes are used,. It will be appreciated though, that the 
interfeature areas, when present, could be of various sizes and configurations. 
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The array features can have widths (that is, diameter, for a round spot) 
in the range from a minimum of about 10 jxm to a maximum of about 1.0 cm. In 
embodiments where very small spot sizes or feature sizes are desired, material can be 
deposited according to the invention in small spots whose width is in the range about 
5 1.0 jam to 1.0 mm, usually about 5.0 \xm to 500 jxm, and more usually about 10 ^m to 
200 jam. Features which are not round may have areas equivalent to the area ranges 
of round features 16 resulting from the foregoing diameter ranges. 

Each array may cover an area of less than 1 00 cm 2 , or even less than 
50, 10 or 1 cm 2 . In many embodiments, the substrate carrying the one or more arrays 
10 will be shaped generally as a rectangular solid (although other shapes are possible), 
having a length of more than 4 mm and less than 1 m, usually more than 4 mm and 
less than 600 mm, more usually less than 400 mm; a width of more than 4 mm and 

03 

ffj less than 1 m, usually less than 500 mm and more usually less than 400 mm; and a 



c _ 



h Jl thickness of more than 0.01 mm and less than 5.0 mm, usually more than 0.1 mm and 

a 

yj 15 less than 2 mm and more usually more than 0.2 and less than 1 mm. With arrays that 

are read by detecting fluorescence, the substrate may be of a material that emits low 
r|j fluorescence upon illumination with the excitation light. Additionally in this 

m situation, the substrate may be relatively transparent to reduce the absorption of the 
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O incident illuminating laser light and subsequent heating if the focused laser beam 

fif 

20 travels too slowly over a region. For example, substrate 10 may transmit at least 
20%, or 50% (or even at least 70%, 90%, or 95%), of the illuminating light incident 
on the front as may be measured across the entire integrated spectrum of such 
illuminating light or alternatively at 532 nm or 633 nm. 

Once the labeled target molecule has been hybridized to the probe on 

25 the surface, the array may be scanned by an appropriate technique, such as by optical 
scanning in cases where the labeling molecule is a fluorophore or by radiometric 
scanning in cases where the signal is generated through a radioactive decay of labeled 
target. In the case of optical scanning, more than one fluorophore can be excited, 
with each different wavelength at which an array is scanned producing a different 

30 signal. In optical scanning, it is common to describe the signals produced by 
scanning in terms of the colors of the wavelengths of light employed for the scan. 
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For example, a red signal is produced by scanning the array with light having a 
wavelength corresponding to that of visible red light. 

Scanning of a feature by an optical scanning device or radiometric 
scanning device generally produces a scanned image comprising a rectilinear grid of 
pixels, with each pixel having a corresponding signal intensity. These signal 
intensities are processed by an array-data-processing program that analyzes data 
scanned from an array to produce experimental or diagnostic results which are stored 
in a computer-readable medium, transferred to an intercommunicating entity via 
electronic signals, printed in a human-readable format, or otherwise made available 
for further use. Molecular array experiments can indicate precise gene-expression 
responses of organisms to drugs, other chemical and biological substances, 
environmental factors, and other effects. Molecular array experiments can also be 
used to diagnose disease, for gene sequencing, and for analytical chemistry. 
Processing of molecular array data can produce detailed chemical and biological 
analyses, disease diagnoses, and other information that can be stored in a computer- 
readable medium, transferred to an intercommunicating entity via electronic signals, 
printed in a human-readable format, or otherwise made available for further use. 

Figure 8 illustrates components of a molecular array scanner. Lasers 
800a-b emit coherent light that passes through electro-optic modulators ("EOMs") 
810a-b with attached polarizers 820a-b. Each EOM and corresponding polarizer 
together act as a variable optical attenuator. A control signal in the form of a variable 
voltage is applied to each EOM 810a-b by controller 880. The controller 880 may 
include a suitably programmed processor, logic circuit, firmware, or a combination of 
software programs, logic circuits, and firmware. The control signal changes the 
polarization of the laser light, which alters the intensity of the light that passes 
through the EOM. In general, laser 800a provides coherent light of a different 
wavelength than that provided by laser 8 1 0b. For example, one laser may provide red 
light and the other laser may provide green light. The beams may be combined along 
a path toward a stage 800 by the use of full mirror 851 and dichroic mirror 853. The 
light from the lasers 800a-b is then transmitted through a dichroic beam splitter 854, 
reflected off fully reflecting mirror 856, and then focused, using optical components 
in beam focuser 860, onto a molecular array mounted on a holder 800. Fluorescent 
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Hi of the system during scanning. An autofocus system includes detector 870, processor 

gS 880, and a motorized adjuster to move the stage in the direction of arrow 896. 

" The controller 880 receives signals from photodetectors 850a-b, called 

fU 

20 "channels," corresponding to the intensity of the green and red fluorescent light 
emitted by probe labels excited by the laser light. The controller 880 also receives a 
signal from autofocus offset detector 870 in order to control stage adjustment, 
provides the control signal to the EOMs 810a-b, and controls the scan system. 
Controller 880 may also analyze, store, and output data relating to emitted signals 

25 received from detectors 850a-b. 

The dynamic autofocus mechanism, described above, provides a 
means for maintaining a constant focus distance during scanning of a molecular 
array. However, any of various different focus distances may be selected to be held 
constant during a scan. In general, the focus distance must correspond to the depth 

30 of focus of the molecular array optics, but, in general, there is one or a small range of 
optimal focus distances for a particular molecular array scanner. Designers, 



light, emitted at two different wavelengths (for example, green light and red light) 
from features of the molecular array in response to illumination by the laser light, is 
imaged using the optics in the focuser/scanner 860, and is reflected off mirrors 856 
and 854. The two different wavelengths are further separated by a dichroic mirror 
5 858 and are passed to photodetectors 850a-b. More optical components (not shown 
in Figure 8) may be used between the dichroic mirror and the photodetectors 850a-b, 
such as lenses, pinholes, filters, and fibers. The photodetectors 850a-b may be of 
various different types, including photo-multiplier tubes, charge-coupled devices, and 
avalanche photodiodes. 
10 A scan system causes a light spot from each laser 800a-b to be moved 

in a regular pattern about the surface of the molecular array. The molecular array is 
mounted to a stage that can be moved in horizontal and vertical directions to position 
light from the lasers onto a particular region at the surface of the molecular array, 
from which region fluorescent emission is passed back to the photodetectors via the 



yj 15 optical path described above. An autofocus detector 870 is provided to sense and 



correct any offset between different regions of the molecular array and the focal plane 
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manufacturers, and users of molecular arrays have recognized a need for an 
automated focus-distance optimization method. 

SUMMARY OF THE INVENTION 
5 One embodiment of the present invention provides an automated 

method for determining an optimal focus distance for scanning a molecular array by a 
molecular array scanner. Blocks of rows of a reference array are automatically 
scanned over a range of focus distances, or z-positions, to produce data providing a 
functional relationship between focus distance, or z-position, and measured signal 
10 intensities. The data is then processed by a peak-height-based, or local variation- 
based, focus-finding routine that selects an optimal focus distance for data scans. 

The present invention further provides a computer program product 
S for use with an apparatus sueh as described herein. The program product inelndes a 

Si 

computer readable storage medium having a computer program stored thereon and 
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UJ 15 which, when loaded into a programmable processor, provides instructions to the 

processor of that apparatus such that it will execute the procedures required of it to 
perform a method of the present invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
20 Figure 1 illustrates a short DNA polymer 100, called an oligomer, 

composed of the following subunits: (1) deoxy-adenosine 102; (2) deoxy- 
thymidine 104; (3) deoxy-cytosine 106; and (4) deoxy-guanosine 108. 

Figures 2A-B illustrate the hydrogen bonding between the purine and 
pyrimidine bases of two anti-parallel DNA strands. 
25 Figure 3 illustrates a short section of a DNA double helix 300 

comprising a first strand 302 and a second, anti-parallel strand 304. 

Figures 4-7 illustrate the principle of the array-based hybridization 

assay. 

Figure 8 is a block diagram of major optical and electronic 
30 components of a molecular array scanner. 



12 



Docket No.10020335-1 



Figure 9 abstractly illustrates the scanner components related to the 
automated focus-distance-determination method that represents one embodiment of 
the present invention. 

Figure 10 illustrates the effect of small changes in the z-position of an 
illuminated spot on the surface of a molecular array with respect to the image of the 
emitted light from the illuminated spot of the surface of the molecular array focused 
on the surface of an optical fiber. 

Figure 1 1 shows a representation of the function of the signal intensity 
/ measured by the photodetector of a molecular array scanner as a function of the z- 
position of an illuminated region of the molecular array that generates the signal. 

Figure 12 illustrates the variation in signal intensity / with z-position z 
in the vicinity of the peak of the \ z curve. 

Figure 1 3 illustrates selection of an optimal focus distance Z p . 

Figure 14 illustrates the signal intensity variation over a z-position 
variation centered about the peak of the I/z curve. 

Figure 15 illustrates the scan pattern employed in a preferred 
embodiment of the present invention. 

Figure 16 is a flow-control diagram for the optimal focus-distance 
determination method M focus_distance" that represents one embodiment of the 
present invention. 

Figures 17-21 illustrate operation of the routine 
"peakintensityjfocus" with reference to I/z functional curves. 

Figure 22 is a flow-control diagram for the routine 
"peakintensityfocus." 

Figures 23-29 illustrate operation of window focus using graphical 
representations of I/z curves. 

Figures 30-31 show flow-control diagrams that describe operation of 
window_focus. 

DETAILED DESCRIPTION OF THE INVENTION 

One embodiment of the present invention is a method for determining 
an optimal focus distance at which to scan the surface of a molecular array during 
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data collection from the molecular array by an automated molecular array scanner. 
The molecular array scanner includes dynamic autofocus functionality for 
maintaining the focus distance at a constant value during scanning of a molecular 

■ 

array. However, current scanners lack an automated focus-distance optimizing 
feature, and instead require a tedious and error-prone manual optimal focus-distance 
determination. The method for automated focus-distance determination that 
represents one embodiment of the present invention may be incorporated into logic 
circuits, firmware, or software, or a combination of logic circuits, firmware, and 
software, within the control logic of a molecular scanner. 

Figure 9 abstractly illustrates the scanner components related to the 
automated focus-distance determination method that represents one embodiment of 
the present invention. A molecular array 902 is mounted to the stage 904 of a 
molecular array scanner. The stage 904 can be translated vertically and horizontally 
in a plane coincident with the plane of the surface of the stage by X and Y translation 
mechanisms not shown in Figure 9. In addition, the distance of the stage 904 from 
the optical fiber or other light-gathering medium can be controlled by a distance- 
control mechanism that moves the stage and stage support 906 in a direction 
perpendicular to the plane of the scanner. 

Light from a laser 908, focused and filtered via various optic 
components 910, illuminates a small region of the array 912 from which light of a 
generally longer wavelength is emitted by fluorescent or chemiluminescent 
compounds incorporated into target molecules. The emitted light is filtered and 
focused by various optical components 914 and 916 into a roughly disk-shaped 
spot 918 that impinges on the surface of an optical fiber 920 or another light- 
collection medium that inputs emitted light from the surface of the molecular array 
into a photodetector 922. 

An orthogonal jc, y, z coordinate system 924 is shown near the left, 
lower corner of the molecular array-scanner stage 904 to indicate the directions in 
which the stage can be moved relative to the surface of the optical fiber 920 on which 
emitted light is focused. The exact functional relationship between variations in the 
distance from the emitted-light source to the surface of the optical fiber 920, closely 
related to the distance of the stage from the optical fiber 920, referred to as the z- 
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position of the stage, depends on the geometry of the laser, molecular array, and 
optical fiber geometries. However, the focus-distance is a continuous function of z- 
position. If the plane of the surface of the molecular array 902 is not parallel to the 
plane of the surface of the stage 904, then horizontal translations in the x, y plane of 
5 the stage may also affect the distance between light-emitting probe molecules on the 
surface of the molecular array and the optical fiber 920. The dynamic autofocus 
feedback-control mechanism within this scanner can dynamically adjust the z- 
position of the stage as the stage is translated in the jc, y plane in order to correct for 
the small focus-distance variation during a scan due to molecular-array-surface 
10 irregularities and positioning and orientation problems that cause the plane of the 
surface of the molecular array to be inclined with respect to the plane of the surface 
Q of the stage. Thus, when an optimal focus-distance is determined, a molecular array 

• * K 

can be scanned at this focus distance by the autofocus mechanisms within the 
scanner. Because the focus distance is a continuous function of the z-coordinate of 
J 15 the spatial location of the illuminated spot 912 of the molecular-array surface, which 

!L is, in turn, a continuous function of the z-position of the stage, the following 

Q 

fjj discussion is related to optimizing the z-position of the stage 904. It should be noted 

2 1 j 

11 that the following discussion and Figures could also be cast in terms of z-coordinate 

O of the laser- illuminated region of a molecular array, or the distance of either the stage 

ru 

20 or the laser-illuminated region of a molecular array from the emitted-light-collection 
medium. 

Figure 10 illustrates the effect of small changes in the z-position of the 
stage, or z-coordinate of an illuminated spot on the surface of a molecular array, with 
respect to the position of the image of the emitted light from the illuminated spot of 

25 the surface of the molecular array focused onto the surface of an optical fiber. This is 
one example of how the measured intensity of the emitted light can vary with changes 
in the focal distance. Another example is the loss of emission-intensity due to the 
lack of correct focus of the illuminated spot on the surface of the molecular array. 

In Figure 10, the circular cross section of the surface of an optical fiber 

30 (920 in Figure 9) is represented by circles 1002-1007. The image of an illuminated 
spot on the surface of the molecular array is shown in Figure 10 by various 
differently sized, cross-hatch-filled circles 1008-1013. In Figure 10, representations 
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1014-1019 of the focusing of the emitted light onto the plane of the circular cross 
section of the surface of the optical fiber are shown at the left. In each representation, 
a lens-like optic, e.g. lens-like optic 1020, focuses light to a point, e.g. point 1021, 
above, on, or below the plane, e.g. plane 1022, of the surface of the optical fiber. As 
5 the illuminated spot on the surface of the molecular array is moved in the z direction, 
the image of the illuminated spot focused onto the surface of the optical fiber, first 
defocused as illuminated spot 1008, focuses and decreases in diameter relative to the 
diameter of the cross section of the optical fiber. Illuminated spot 1010 represents a 
focused spot, corresponding to the optics representation 1016. As the illuminated 

10 spot on the surface of the molecular array is moved further in the z direction, the 
illuminated spot gradually defocuses and grows in diameter relative to the diameter of 
the cross section of the optical fiber. At either extreme of the z-position range, as 
indicated by optics representations 1014 and 1019, the image of the illuminated spot 
is sufficiently defocused that the image of the illuminated spot is larger than the cross 

15 section of the optical fiber, and emission photons representing a portion of the 
emitted-light are lost, resulting in a decrease in the intensity of the resulting measured 
signal. In Figure 10, the z-positions that result in the corresponding respective sizes 
of the image of the illuminated spot and the cross section of the optical fiber plotted 
along a coordinate axis 1024 corresponding to the z-position of the stage of the 

20 molecular scanner. The z-positions range from a lowest value, Z^ x , to a highest z 

value > z Lx • In Figure 10, for example, the z-position of the stage resulting in a 
centered, focused image of the illuminated spot on the surface of the optical 
fiber 1026 is shown with z coordinate 1028. In Figure 10, a vertical coordinate 
axis 1030 is shown orthogonal to the z-position coordinate axis 1024. This 

25 orthogonal coordinate axis corresponds to the intensity of the emitted light detected 
by a photodetector (922 in Figure 9) of the molecular array scanner. In Figure 10, the 
horizontal lines, such as horizontal line 1032, plotted along the vertical z-position axis 
represent the signal intensities measured for a particular region of the surface of the 
molecular array corresponding to the z-positions of the stage of the molecular array. 

30 Again, as the image of the illuminated region of the molecular array defocuses past 
the edges of the fiber, not all of the emitted light is captured by the fiber and, as a 
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result, the intensities 1034-1036 of the signal level decreases proportionally to the 
fraction of light that is missing the fiber. 

The graph of signal intensity to z-position, as shown in Figure 10, can 
be rotated 90 degrees and plotted as a continuous, rather than a discrete, function. 
5 Figure 1 1 shows a representation of the function of the signal intensity / measured by 
the photodetector of a molecular array scanner as a function of the z-position of an 
illuminated region of the molecular array that generates the signal. The function 
illustrated in Figure 11 1102 is exaggerated in order to clearly illustrate the general 
features of a typical signal-intensity-to-z-position ("I/z") relationship for one class of 
10 molecular array scanners. As in Figure 10, the functional relationship is plotted from 

Q a Z~^ x position to a Z^ x position along the z-position axis 1 1 04, with the height of 

the curve representing the measured signal intensity with respect to the signal- 

y ' intensity axis 1106. Proceeding rightward from the Z~ position 1108 towards to 

the Z^ x position 11 10, the signal intensity sharply rises to a signal intensity 

si? 

5 15 peak 1112 and then falls gradually to the edge 1114 of a steeply descending curve 

that falls back to nearly 0 intensity at Z^ x . In one class of molecular array scanners, 
the I/z function appears as a mesa, with the top of the mesa (section of the curve 



STVT 



J between 1112 and 1 1 14 in Figure 1 1) slanted slightly downward. 

fjj 

The autofocus mechanism within a molecular array scanner, as with 
20 all electromechanical devices, is not infinitely precise. Instead, the autofocus 
mechanism is capable of maintaining the distance of the illuminated portion of a 
molecular array surface within a small range of distances by controlling the z-position 
of the stage. Thus, despite the autofocus mechanism, the z-position, functionally 
related to the focus distance, varies slightly over the course of a molecular array scan. 
25 Figure 12 illustrates the variation in signal intensity / with z-position z in the vicinity 
of the peak of the / curve. In Figure 12, the z coordinate for the peak, z p , is plotted 

on the z axis 1202. The total variation in z-position during a scan is shown by the 
interval Az 1204 plotted from z, to z r on the z axis 1206-1207. The signal intensity 

varies over the interval Az 1204 by a corresponding signal intensity interval AT. Note 
30 that, for the typical I/z functional relationship illustrated in Figure 12, the signal 
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intensity varies over the entire AI interval 1208 within the portion of the Az interval to 
the left of the peak position Z p , while varying only relatively slightly in the portion 

of the Az interval to the right of the peak position Z p . Thus, because of the 

asymmetry of the I/z curve with respect to the peak position Z p , small fluctuations in 

5 the focus distance corresponding to Z p may result in relatively large variation in 

signal intensity. 

It is desirable to select a focus distance that provides a relatively slight 
variation in signal intensity corresponding to slight variations in focus distance during 
scanning of a molecular array. It is the goal of an autofocus mechanism to help 
1 0 maintain a constant system sensitivity for a constant quantity of flurophore over the 
entire surface of a molecular array. Figure 1 3 illustrates selection of an optimal focus 
distance Z f . As shown in Figure 13, if the optimal focus distance Z f is chosen 

roughly centered in the interval representing the top of the mesa of the I/z curve, 
producing a Az interval 1302 within the top of the mesa, then the variation in signal 



W 15 intensity AI 1304 over the interval Az is relatively small compared with the AI 

ry interval (1208 in Figure 12) resulting from selection of the focus distance at the peak 

of the I/z curve Z p . 

If, by contrast, the scanner geometry and photo-detection apparatus 
produces a symmetrical, Gaussian : like I/z functional curve, selection of a focus 
20 distance corresponding to the z-position of the curve peak Z p may result in the 

smallest possible signal intensity variation. Figure 14 illustrates the signal intensity 
variation over a z-position variation centered about the peak of the I/z curve. 

Thus, either a focus distance corresponding to a z-position at a positive 
offset from the peak z-position may be selected, for I/z curves having slanted mesa 
25 forms, or a focus distance corresponding to the peak z-position may be optimal for 
symmetrical, Gaussian-like I/z functional relationships. In either case, the optimal 
focus distance is a focus distance that gives a relatively large signal intensity that 
varies relatively slightly with variation in the focus distance. 

The method for optimal focus distance selection that represents one 
30 embodiment of the present invention relies on a single-pass, automated scan of a 
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reference molecular array. The reference molecular array is prepared by precisely 
coating the surface of a molecular array substrate with a polymethylmethacrylate 
polymer ("PMMA") containing a fluorescent or chemiluminescent dye. The PMMA 
polymer can be spun onto the surface of a slide and, if necessary, planarized, using 
5 PMMA-substrate-preparation techniques commonly employed in the manufacture of 
semiconductors. PMMA-based reference arrays are far more uniform than previously 
employed cyanine-dye reference arrays. The U.S. patent application Serial No. 
10/008598 entitled "Devices For Calibrating Optical Scanners And Methods Of 
Using The Same" by Holcomb et al. (filed Dec. 4/01) details the PMMA-based 
10 reference arrays, and is herein incorporated by reference. 
Q Figure 15 illustrates the scan pattern employed in a preferred 

embodiment of the present invention. In Figure 15, a molecular array 1502 is 
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Cj] abstractly represented with horizontal rows, such as row 1504. The scan pattern 

SI 

2 involves scanning a fixed number of contiguous rows n at each z-position over a 

» 

1 5 range of z-positions Z max to Z*^ . The z-position is incremented by an increment / 

O over this range of z-positions. For example, as shown in Figure 15, a contiguous 

block of n rows is first scanned at z-position z = -7z, a second block of n contiguous 
rows is scanned at z-position z = -61 1508, and so on, up through scanning of the 

LJ 

n I continuous block of n rows 1 5 1 0 at z-position z = 7i. The number of rows scanned at 



20 each z-position w, the z-position increment /, and the range Z niax to Z*^ may all be 

specified in a configuration file for the automated focus-distance determination. Note 
that there are many alternative possible focus-distance-determination scan patterns. 
In one embodiment of the present invention, z positions ranging from -20 /J 

(microns) to +20 ju are scanned at 0.25 jj. increments, with ten rows scanned for each 

25 z-position. 

In the following discussion, the described embodiment relates to a 
two-channel molecular array scanner that measures emitted light in red and green 
regions of the visible spectrum. Thus, the following discussion refers to red and 
green channels and red and green signals. However, the present invention is equally 
30 applicable to single-channel molecular array scanners or to molecular array scanners 
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that measure more than two types of signals, or measure signals from different parts 
of the electromagnetic radiation spectrum. 

Figure 16 is a flow-control diagram for the optimal focus-distance 
determination method "focus_distance" that represents one embodiment of the 
present invention. In step 1602, the scanner is initialized. Initialization involves 
checking the scanner to make sure that the scanner is properly physically configured, 
setting the photodetector voltages to default voltages, initializing a stage motion 
controller(s), and loading the reference molecular array into the scanner and 
verifying, via a bar code imprinted on the surface of the molecular array, that the 
reference molecular array is the appropriate reference molecular array for the optimal 
focus-distance determination. In step 1604, the optimal focus-distance determination 
configuration file is input, specifying various parameters for the optimal focus- 
distance determination, including the parameters n 9 i, Z"^ and Z^ K parameters 

described with reference to Figure 15, above. In step 1606, the reference array is 
scanned as specified by the configuration-file parameters input in step 1604. The 
reference-array scan results in a computer encoding of signal intensities measured for 
discrete regions covering the surface of the molecular array. In many molecular array 
scanners, these signal-intensity results are encoded as integer or floating-point values 
associated with pixels within the scanned image of the molecular array. In the for- 
loop comprising steps 1608, 1610, and 1612, focus_distance filters and averages the 
signal intensities for each block of n rows. The rows are filtered to remove saturated 
pixels, or, in other words, pixels with intensity values that exceed the linear range of 
the electronics (due, for example, to dust), and the signal intensities of the remaining 
pixels are averaged to produce an average intensity value corresponding to the z- 
position at which the rows were scanned. Note that, if the number of saturated pixels 
exceeds some specified threshold, the scan may be rejected, and an indication 
displayed by the molecular array scanner to a user indicating that the molecular array 
scanner needs to be reconfigured in order to produce signal intensities within 
appropriate ranges. 

Next, in step 1614, the average intensity values for each z-position are 
assembled into a computer-encoded format corresponding to the I/z graphs shown in 
Figures 11-13 and 14, above. Note that each type of signal, such as the red and green 
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signals described above, in the background section, are measured to produce separate 
I/z functional relationship for each type of signal, or channel. The computer-encoded 
data representing the I/z functional relationship for each channel can then be scanned 
with respect to z-position to determine the / and z coordinates of the peak of the I/z 
curve. In step 1616, focus_distance determines whether the z-positions of the I/z 
peaks for all channels fall within an acceptable range of z-position values. If not, 
then focus distance displays an error message in step 1618 indicating the need for 
resetting the coarse adjustment of the z-component of the autofocus mechanism, or 
another resetting or reconfiguration of the molecular array scanner, and then returns 
an error. If all the peaks fall within an acceptable z-position range, then a focus 
routine is called in step 1620. The focus routine called depends on the geometry and 
configuration of the molecular array scanner. For molecular array scanners that 
produce mesa-shaped I/z functional curves, the routine "window_focus" is called. 
Alternatively, if the molecular array scanner produces symmetrical, Gaussian-like 
curves, as shown in Figure 14, then the focus routine "peakintensityfocus" is 
called. Molecular scanners that produce other types of I/z curves may need other, 
specific focus routines. The focus function called in step 1620 produces an optimal z- 
position corresponding to an optimal focus distance, Z /? that can be used by the 

molecular array scanner for one or more data-acquisition scans. 

Next, in step 1622, focus_distance graphically displays the results of 
the optimal focus-distance determination to a user. If the user wishes they can review 
the displayed information and can input acceptance or rejection of the optimal focus- 
distance determination. If the method is found to be robust enough for any particular 
configuration of scanner, this step can be skipped and the process be fully automated. 
If the optimal focus-distance determination is accepted, as determined in step 1624, 
then the optimal focus distance is stored into non-volatile memory within the array 
scanner in step 1626, and focus_distance returns success. The optimal focus distance 
may be stored in flash memory, for example, or in other non-volatile memory storage, 
such as a mass storage device. If the optimal focus distance is rejected, as determined 
in step 1624, then the user is prompted for input as to whether to retry the optimal 
focus-distance determination. If the user elects not to retry the optimal focus-distance 
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determination then focus_distance returns an error. Otherwise, control flows back to 
the initial step, step 1602. 

For molecular scanners that produce Gaussian-like, roughly 
symmetrical I/z curves, as shown in Figure 14, the focusing routine 
M peak_intensity_focus M is called from step 1620 of focus_distance, described with 
reference to Figure 16. Figures 17-21 illustrate operation of the routine 
"peak_intensity_focus" with reference to I/z functional curves. Figure 22 is a flow- 
control diagram for the routine M peak_intensity_focus." Note that the I/z curves 
shown in Figures 17-21 are mesa-shaped curves characteristic of one class of 
molecular array scanners. Although the routine "peak_intensity_focus" provides best 
results for symmetrical, Gaussian-like Vz curves, it can also be applied to the mesa- 
shaped curves shown in Figures 1 7-2 1 . 

Figure 17 is a graph of the I/z curve for the red channel 1702 and the 
I/z curve for the green channel 1 704 of a two-channel molecular array scanner. The 
routine "peak_intensity_focus" first determines the z-position Z p 1 706 of the peak in 

intensity for the red-channel I/z curve 1702, as shown in Figure 17. Next, as shown 
in Figure 18, peak_intensity_focus begins from the z-position Z'^ and proceeds 

rightward, as shown by arrow 1802, searching for a Z, z-position at which the 

corresponding signal intensity rises to a threshold percentage of the signal intensity 
corresponding to the Z p peak z-position. After finding the z-position Z, , 

peak intensity focus starts at z-position Z p 1804 and proceeds rightward, as 

indicated by arrow 1806, evaluating the signal intensities of successive z-positions 
until peak_intensity_focus identifies a position Z r 1808 at which the corresponding 

signal intensity falls below the threshold value. As shown in Figure 19, the z- 
positions Z, 1 803 and Z r 1 808 define endpoints of a red-channel plateau interval P r 

1902 for the red-channel I/z curve. In similar fashion, peak_intensity_focus applies 
the plateau-finding technique to identify a green-channel plateau interval P g shown 

as interval 1202 in Figure 20. Finally, as shown in Figure 21, peak intensity focus 
determines whether the overlap 2102 between the red-channel plateau interval P r 
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1902 and green-channel plateau interval 2002 is greater than or equal to a 

threshold length, in microns, and determines whether or not each of plateau intervals 
P and P are also greater than or equal to a threshold width, in microns. In one 

embodiment, the threshold width for both the overlap 2102 and the plateau 
5 intervals 1902 and 2002 is four microns. If the overlap and plateau intervals do not 
meet the threshold requirements, then peak_intensity_focus returns an error value. 
Otherwise, in one embodiment, peak_intensity_focus returns a focus Z f 

corresponding to the z-position of the midpoint of the red-channel plateau 
interval 1902. 

10 The routine "peak^intensityfocus" is alternatively described by the 

flow-control diagram of Figure 22. In step 2202, peak_intensity_focus receives the I/z 
for each channel scanned by the molecular array scanner. In the ^br-loop comprising 
steps 2204-2209, peak_intensity_focus determines the plateau interval for each 
L?j channel. In step 2205, peak_intensity_focus determines the z-position Z p 

Q 15 corresponding to the peak intensity for the currently considered channel c. In 

steps 2206 and 2207, peak_intensity_focus determines the Z { and Z r z-positions for 

the plateau interval for the currently considered channel c, as described above with 
fy reference to Figures 18-19. Then, in step 2208, peak_intensity_focus reports and 

stores the plateau interval P c for the currently considered channel c. In step 2209, 

20 peak_intensity_focus determines whether or not the plateau for another channel needs 
to be computed and, if so, control flows back to step 2205. When all the plateau 
interval shave been computed, peak_intensity_focus, in step 2210, determines the 
overlap between the calculated and stored plateau intervals for all channels 
considered in the for-loop comprising steps 2204-2209. In step 2212, 
25 peak_intensity_focus determines whether or not each plateau interval is greater than 
or equal to a threshold value, in one embodiment, 4 microns. If not, then an error is 
returned. Otherwise, in step 2214, peak_intensity_focus determines whether or not 
the overlap, calculated in step 2210, is greater than or equal to a threshold value, in 
one embodiment, also 4 microns. If not, then an error is returned. Otherwise, a focus 
30 z-position Z f is selected in step 2216. In one embodiment, the midpoint of the red- 
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channel plateau interval is selected as the focus Z f . In other embodiments, a 

centroid z-position for the plateau intervals or for the overlap may be selected as the 
focus Z f . In all embodiments, peak intensity focus returns the final selected focus 

Z f in step 2218. 

5 An alternate focus-finding routine is the focus-finding routine 

M window_focus, M also called in step 1620 of the routine focus_distance illustrated in 
Figure 16. The routine "window_focus" is more suitable for the mesa-shaped I/z 
curves characteristic of one class of molecular array scanners, for reasons described 
above with respect to Figures 12 and 13. Figures 23-29 illustrate operation of 
10 window_focus using graphical representations of I/z curves. Figures 30-31 show 
flow-control diagrams that describe operation of windowfocus. 

--■J.ir, 

*W In a first step, as shown in Figure 23, window_focus computes the z- 

SJ positions Z p and Z p 2302 and 2304 for the red-channel I/z curve 1306 and the 

y j green-channel I/z curve 1308, respectively. Figures 23-29 assume a two-channel 

-~ 1 5 molecular array scanner. Then, window focus checks to make sure that these peak 

w 

HI positions Z p and Z p fall within a range of z-position values 1310 and 1312 that 

represents 80 percent of the range Z^ x - Z^ x . If the peak z-positions Z p and Z p 

fall outside this range, an error is returned. 

Next, window focus divides the z-position range Z^ x - Z^ x .into a 

20 number of evenly sized increments. In one embodiment of the present invention, 
each increment represents a l A micron change in the z-position. The window focus 
algorithm places the left edge of a window, or z-position interval, at each discrete z- 
position increment, with the last few increments near Zm ax + omitted, as truncated 
windows would result. Therefore, there nearly as many windows as there are depth 
25 settings. Figure 24 illustrates the windows initially constructed by window focus. A 
first window is defined by vertical lines 2402-2403. A second window is defined by 
vertical dashed lines 2404-2405. A third window is defined by vertical dashed-and- 
crossed lines 2406-2407. 

Initially, window_focus employs wider windows to determine 
30 approximate plateau intervals for each channel. In one embodiment, an initial 8- 
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micron window is used. If the window-based plateau-interval determination fails to 
work for wider windows, it is tried again for narrower windows. Figure 25 
illustrates narrower windows. The plateau-interval determination is illustrated in 
Figures 26 and 27 for red and green channels, respectively. In Figure 26, 

5 windowfocus starts from the Z^ x z-position and proceeds rightward, as indicated 

by arrow 2602, determining a slope for each window. This slope can be calculated as 
the difference between the largest and smallest signal intensity seen inside the 
window expressed as a percentage of maximum intensity for the entire I/z curve 

between Z^ x and Z^ x . Various methods can be used to determine the slope for a 

m 10 window, including applying a differential smoothing operator and then dividing the 

2 difference between the highest and lowest intensity values within the window by the 

length of the window increment. Alternative techniques may also be employed. 
!r | Window focus determines the slope of all initial windows. Window focus looks for 

positions for which the difference between the maximum and minimum intensity 
1 5 values within the window are less than 2% of the maximum intensity found anywhere 
on the entire I/z curve. If window focus finds at least one such window, it will return 
jj the z-position of the center of the window with the smallest intensity variation. If 
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ft? window focus does not find a window with less than a 2% difference, then 

fj i window focus tries again with smaller-sized windows, in one embodiment, 6-micron 

20 windows If window_focus fails to find a window with less than a 2% difference, 
then window focus tries again with 4 micron windows. If window_focus fails to find 
a 4-micron window with less than a 2% difference, then window focus returns the 4- 
micron with the smallest intensity variations, using an RMS-like intensity variation 
measure discussed below. In Figure 26, window 2704 is shown as meeting the 
25 criteria for a plateau window, although, as pointed out above, the slope of the plateau 
in the figures is exaggerated, and thus appears to create an intensity variation greater 
than 2%. 

Figure 27 illustrates determination of the plateau window 2702 for the 
green channel. Thus, by the window-based method, a red-channel plateau 
30 interval 2704 and a green-channel plateau interval 2702 are determined with respect 
to the z-position coordinate axis. As shown in Figures 28A-B, window focus 
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determines an overlap region common to the determined plateau intervals for each 
channel. In Figures 28A-B, the plateau windows for the red channel (2802 and 2804) 
are plotted horizontally in z-position (with respect to the plateau windows for the 
green channel (2806 and 2808). Z axis segments (2810-2813) are plotted above and 
5 below the plateau-window plots. In Figure 28A, the overlap region begins at z- 
position 2814 and extends to z-position 2816. In Figure 28B, the overlap region 
begins at z-position 2818 and extends to z-position 2820. When, as in Figure 28A, 
the overlap region is less than some threshold value, in one embodiment four 
microns, or the signal intensities within the plateau intervals differ from one another 
10 by greater than some threshold amount, in one embodiment 2 percent, then, as shown 

M 

q in Figure 28A, a left boundary Z, 2822 and a right boundary Z r 2824 are chosen as 

y 



the left-most and right-most boundaries of the plateau intervals. By contrast, when 
the width of the overlap region exceeds a threshold value and the signal intensities 
within the plateau intervals differ by less than a threshold amount, then the left 
y J 15 boundary Z, is chosen as the left-most point of the overlap 2818 and the right 
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•«3 . boundary Z r is chosen as the right-most point of the overlap region 2820, as shown 

ru 

in Figure 28B. 
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~L Window focus then centers smaller windows, in one embodiment, 2- 

Hi micorn windows, at each z-position within the left and right boundaries, and searches, 

20 starting from the center, to find a small window with acceptable criteria most close to 
the center of the Z,-to-Z r interval. The criteria used in one embodiment, for two- 
micron narrow windows, is that the signal intensities within the window must differ 
by less than 1 percent. In one embodiment, the search first considers narrow 
window 2902, next considers window 2904, next considers window 2906, next 
25 considers window 2908, and so forth, in a ping-pong like fashion. The first window 
considered that meets the criteria is selected as the focus window, and the z-position 
of the window's center is selected as the focus Z f . 

Figures 30 and 3 1 show flow-control diagrams for the routine 
"window_focus M and the routine M micro_focus," called from the routine 
30 "window_focus." Beginning with Figure 30 in step 3002, window_focus receives the 
I/z data for scans of the reference array in each channel. In step 3004, window_focus 
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determines the z-positions for the signal intensity peaks for each channel, as 
discussed above with reference to Figure 23, and checks to see that they are within 

the central 80-percent interval of the full z-position range Z~^ x to Z* ax . If all the 

peak positions are not within the 80 percent central range, as determined by 
5 window_focus in step 3006, then an error is returned. Otherwise, in the nested for- 
loops comprising steps 3008-3014, window_focus determines the plateau intervals 
for each channel, as described above with reference to Figures 26-28B. Then, as 
discussed with reference to Figure 29, window focus calls the routine "microfocus," 
in step 3024, to compute the z-position corresponding to the optimal focus distance 
3 10 Z f9 returning the optimal focus z-position Z f in step 3026, along with a status value 

~ indicating whether or not the plateau determination carried out in steps 3008-3014 

03 successfully completed. Step 3008 is the first step of the outer for-loop of the nested 

01 

Zf! ybr-loops of steps 3008-3014. The outer ybr-loop is repeated over a series of 

'"•4 

Jp decreasing window sizes, as described with reference to Figures 24-25, in order to 

^ 15 select plateau intervals for each channel scanned by the molecular array scanner. 



C,1 The inner for-loop comprising steps 3009-3011 determines plateau 

jpjl intervals for each channel by the method described above with reference to 

CO Figures 26 and 17. A flow-control diagram for the routine 

o 

fi i "determine_plateau_interval," called from step 3010, is not provided, as the technique 

20 is thoroughly described, above, with reference to Figures 26 and 27. As noted there, 
plateau intervals span z-positions corresponding to the top of a mesa, and are 
identified as windows with essentially flat or nearly flat slopes that follow steeply 
sloped windows that contain the walls of the mesa. 

When the plateau intervals for all channels have been determined for a 
25 current window size, window_focus determines, in step 3011, whether the signal 
variation, or differences, within each plateau interval is less than 2%. If not, then, in 
step 3012, window_focus determines whether or not the current window size is the 
smallest window size allowed for the outer for-loop. If not, then window_focus 
decrements the window size, in step 3013, and control returns to step 2009, where the 
30 inner ybr-loop is again executed. If the smallest window size has been tried, then 
window_focus returns, as the plateau window for each channel, the smallest windows 
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with the least variation in signal intensity, using an RMS-like measure to be discussed 
below. Once the plateau intervals for each channel are found, then, in step 3016, 
window_focus computes the overlap between plateau intervals. If the overlap is 
acceptable, as determined in step 3018, then, as described with reference to 
Figure 28B, above, window_focus sets the left and right boundaries Z/ and 
Z r respectively, to the left and right boundaries of the overlap region, in step 3020. 

Otherwise, as described above with reference to Figure 28A, window focus sets the 
Z ; and Z r boundary z-position values to the left-most z-position and right-most z- 

position of the various plateau intervals, in step 3022. Note that this step may also set 
an error return value to indicate an error condition corresponding to not finding a 

suitable plateau window for at least one channel. Following setting of the Z, and Z r 

boundary z-positions in step 3020 or 3022, window_focus calls the routine 
M micro_focus" in step 3018. 

Figure 31 shows a flow-control diagram for the routine "micro_focus," 
called from step 3018 of Figure 30. In step 3102, microfocus computes the center of 

the Z,-to-Z r interval Z c . Next, in step 3104, micro focus applies the small 

windows to each increment within the Z ; -to-Z r interval, starting with the center of 

the Z 7 -to- Z r interval Z c . In one embodiment of the present invention, 2-micron 

windows are used. In step 3106, micro focus determines whether there are an odd 

number of windows in the Z,-to-Z r z-position range. If so, then in step 3108, 

micro_focus determines whether the center window has acceptable characteristics. 
As discussed above with reference to Figure 29, acceptable characteristics may 
include a variation of signal intensities within the center window less than 1 percent. 
Alternatively, a slope-based method or another method for determining the degree of 
signal-intensity constancy within the window may be employed. If the center 
window is acceptable, then, in step 31 10, micro_focus sets the z-position 

corresponding to the optimal focus distance, Z /5 to the z-position of the center of the 

center window and returns success. Otherwise, in the for-loop comprising 
steps 31 12-3116, micro_focus carries out a ping-pong search of windows, out from 
the central z-position Z c , to find an acceptable window closest to the center Z c . The 
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index i is set to the first window to the right of the center, and the index j is set to 
index the first window to the left of the center. In each iteration of the for-loop> the 
index i is incremented and the index j is decremented by 1 increment, in step 3116. 
In step 3113, micro_focus determines whether the window index by index i is 
acceptable, according to the criteria described above for acceptability of the center 
window in step 3108. If so, then control flows to step 3110, where the z-position 
corresponding to the optimal focus distance Z f is set to the center of the window 

indexed by variable i. Otherwise, if the window indexed by index j is acceptable 
according to the criteria applied to the central window in step 3108, then the z- 
position corresponding to the optimal focus distance is set to the center of the window 
indexed by j, in step 3110. Otherwise, if there are more windows to consider in the 
range Z, -to-Z r , as determined in step 3115, then the indices are incremented and 

decremented in step 3116 and another iteration of the for-loop is carried out. If an 
acceptable window is not found following termination of the for-\oop then, in 
step 3118, micro_focus computes a root-mean-square-like ("RMS-like") value, 

^((max(/ r )-min(/ r ))/I rmax ) 2 +((max(/ g )-min(^))// gmax ) 2 , for each window, 

where max(/ r ) is maximum red-channel intensity for the window, min(/ r ) is the 
minimum red-channel intensity for the window, l m3ix is the maximum red intensity of 
the red-channel I/z function, max(4) is maximum green-channel intensity for the 
window, min(/ g ) is the minimum green-channel intensity for the window, and Igmax is 
the maximum green intensity of the green-channel I/z function. Micro_focus then 
selects the window with the smallest RMS-like value in step 3120. The z-position of 
the center of the selected window is returned as the z-position corresponding to the 

optimal focus distance Z f . 

Although the present invention has been described in terms of a 
particular embodiment, it is not intended that the invention be limited to this 
embodiment. Modifications within the spirit of the invention will be apparent to 
those skilled in the art. For example, many different scan patterns may be used to 
construct the I/z data employed by the automated focus-distance determination 
routines. Different ranges of window sizes may be employed, and different 
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orderings in window searching may be employed. The automated focus-distance 
determination routines can be implemented in many different languages, or 
hardware circuits, in an almost limidess number of ways, using different modular 
organizations and control logic. The methods was generally described in flow- 
control diagrams to include an arbitrary number of channels, and is intended to be 
applicable for 1 -channel, 2-channel, and many -channel molecular array scanners. 

The foregoing description, for purposes of explanation, used specific 
nomenclature to provide a thorough understanding of the invention. However, it will 
be apparent to one skilled in the art that the specific details are not required in order 
to practice the invention. The foregoing descriptions of specific embodiments of the 
present invention are presented for purpose of illustration and description. They are 
not intended to be exhaustive or to limit the invention to the precise forms disclosed. 
Obviously many modifications and variations are possible in view of the above 
teachings. The embodiments are shown and described in order to best explain the 
principles of the invention and its practical applications, to thereby enable others 
skilled in the art to best utilize the invention and various embodiments with various 
modifications as are suited to the particular use contemplated. It is intended that the 
scope of the invention be defined by the following claims and their equivalents: 



